RSAT 2018: regulatory sequence analysis tools 20th anniversary
6 Pags.- 1 Tabl.- 1 Fig. © The Authors 2018. Published by Oxford University Press on behalf of Nucleic Acids Research. This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com ; RSAT (Regulatory Sequence Analysis Tools) is a suite of modular tools for the detection and the analysis of cis-regulatory elements in genome sequences. Its main applications are (i) motif discovery, including from genome-wide datasets like ChIP-seq/ATAC-seq, (ii) motif scanning, (iii) motif analysis (quality assessment, comparisons and clustering), (iv) analysis of regulatory variations, (v) comparative genomics. Six public servers jointly support 10 000 genomes from all kingdoms. Six novel or refactored programs have been added since the 2015 NAR Web Software Issue, including updated programs to analyse regulatory variants (retrieve-variation-seq, variation-scan, convert-variations), along with tools to extract sequences from a list of coordinates (retrieve-seq-bed), to select motifs from motif collections (retrieve-matrix), and to extract orthologs based on Ensembl Compara (get-orthologs-compara). Three use cases illustrate the integration of new and refactored tools to the suite. This Anniversary update gives a 20-year perspective on the software suite. RSAT is well-documented and available through Web sites, SOAP/WSDL (Simple Object Access Protocol/Web Services Description Language) web services, virtual machines and stand-alone programs at http://www.rsat.eu/. ; French Government implemented by RENABI-IFB program [ANR-11-INSB-0013] to N.T.T.N.; ANR [ANR-14-CE11-0006-02] to M.T.C. and D.T.; A.M.-R.'s laboratory is supported by a CONACYT grant [269449]; Programa de Apoyo a Proyectos de Investigación e Innovación Tecnológica – Universidad Nacional Autónoma de México (PAPIIT-UNAM) grant [IA206517]; M.T.-C., A.M.R and D.T. further acknowledge SEP-CONACYT – ECOS-ANUIES support. J.A.C.M. benefited from a PhD grant from the Ecole Doctorale des Sciences de la Vie et de la Sant´e, Aix-Marseille Université, and is supported by Norwegian Research Council [187615]; Helse Sør-Øst, and University of Oslo through the Centre for Molecular Medicine Norway (NCMM); B.C.M. was funded by Spanish MINECO [AGL2016-80967-R] and by Aix-Marseille Universit´e as Chercheur Invit´e in 2015; C.D.R.-E.'s laboratory is supported by a Wellcome Trust Seed Award [204562/Z/16/Z]; PAPIIT-UNAM grant [IA200318]; R.O. is supported by a PhD studentship from CONACYT. Funding for open access charge: Agence Nationale de la Recherche. ; Peer reviewed